Defining the Goals to Optimise Data Mining Performance
نویسندگان
چکیده
In many data mining problems the definition of what structures in the database are to be regarded as interesting or valuable is given only loosely. Typically this is regarded as a source of ambiguity and imprecision. However, we propose taking advantage of the looseness of the definition by choosing a particular definition which optimises some additional criterion. We illustrate using a consumer credit data set, where the definition of what constitutes a bad risk customer is somewhat arbitrary. Instead of adopting the common strategy of freely choosing some definition, we choose that which optimises predictability. That is, we choose to define our classes on the grounds that they are the ones amongst those which can be most accurately predicted.
منابع مشابه
Diagnosis of diabetes by using a data mining method based on native data
Background & Aim: Detecting the abnormal performance of diabetes and subsequently getting proper treatment can reduce the mortality associated with the disease. Also, timely diagnosis will result in irreversible complications for the patient. The aim of this study was to determine the status of diabetes mellitus using data mining techniques. Methods: This is an analytical study and its databas...
متن کاملEfficiency concerns in Privacy Preserving Association Rule Mining -Optimization of algorithm MASK
An interesting new direction for data mining research is the development of techniques that incorporate privacy concerns. Being an emerging field, major concentration so far has been on defining the metrics of privacy and establishing the technical feasibility of development of accurate models about aggregated data while meeting the goals of privacy. Thus the goal of the research in privacy pre...
متن کاملOne Scan is Enough: Optimising Association Rules Mining
Data mining is as a new area of research has taken its place as one of the most important techniques in the decision making process. Mining association rules is one of simple yet powerful technique in the data mining process The problem of mining association rules is composed of finding the large itemsets and to generate the association rules from these itemsets. Usually the dataset must be sca...
متن کاملCombining fuzzy RES with GA for predicting wear performance of circular diamond saw in hard rock cutting process
Predicting the wear performance of circular diamond saw in the process of sawing hard dimensional stone is an important step in reducing production costs in the stone sawing industry. In the present research work, the effective parameters on circular diamond saw wear are defined, and then the weight of each parameter is determined through adopting a fuzzy rock engineering system (Fuzzy RES) bas...
متن کاملAn Integrated DEA and Data Mining Approach for Performance Assessment
This paper presents a data envelopment analysis (DEA) model combined with Bootstrapping to assess performance of one of the Data mining Algorithms. We applied a two-step process for performance productivity analysis of insurance branches within a case study. First, using a DEA model, the study analyzes the productivity of eighteen decision-making units (DMUs). Using a Malmquist index, DEA deter...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998